Picture for Deepti Ghadiyaram

Deepti Ghadiyaram

Facebook AI

Swift Sampling: Selecting Temporal Surprises via Taylor Series

Add code
May 21, 2026
Viaarxiv icon

FAGER: Factually Grounded Evaluation and Refinement of Text-to-Image Models

Add code
May 18, 2026
Viaarxiv icon

FuTCR: Future-Targeted Contrast and Repulsion for Continual Panoptic Segmentation

Add code
May 12, 2026
Viaarxiv icon

A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning

Add code
Apr 05, 2026
Viaarxiv icon

Semantic Richness or Geometric Reasoning? The Fragility of VLM's Visual Invariance

Add code
Apr 02, 2026
Viaarxiv icon

Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary

Add code
Mar 12, 2026
Viaarxiv icon

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Add code
Feb 19, 2026
Viaarxiv icon

Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks

Add code
May 29, 2025
Viaarxiv icon

Improving Physical Object State Representation in Text-to-Image Generative Systems

Add code
May 04, 2025
Figure 1 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Figure 2 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Figure 3 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Figure 4 for Improving Physical Object State Representation in Text-to-Image Generative Systems
Viaarxiv icon

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Add code
Mar 09, 2025
Figure 1 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Figure 2 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Figure 3 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Figure 4 for What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization
Viaarxiv icon